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AN ANTIGEN FOR DEVELOPING NEUTRALIZING 
ANTIBODIES TO HUMAN IMMUNODEFICIENCY VIRUS 



This application claims priority from U.S. Provisional Application, Serial No. 
60/210,322, filed June 8, 2000, which is hereby incorporated by reference. 

5 The invention described herein was made under a grant from the National Institutes of 

Health, Grant No. AI42382-01 Al . The Government may have certain rights in the invention. 

BACKGROUND OF THE INVENTION 

Infection of cells by HIV-1 is initiated by fusion of the viral and cellular membranes, 
a process mediated by the envelope glycoprotein on the surface of the virion. The HIV-1 

10 envelope glycoprotein, gpl60, is posttranslationally cleaved into two noncovalently 

associated subunits, gpl20 and gp41 (7). The envelope glycoprotein complex consists of a 
trimer of three gpl20 surface subunits and three gp41 transmembrane subunits. Analysis of 
its structure by X-ray crystallography shows that the NH 2 -terminal region of the ectodomain 
of gp41 forms a central, three-stranded coiled coil that is wrapped by an outer layer of three 

15 COOH-terminal helices in an antiparallel orientation around the outside of the coiled coil (6). 

Each gp41 chain is anchored at its COOH terminus in the viral membrane. gpl20 
determines viral tropism by binding to the cellular receptors, CD4 and members of the 
chemokine receptor family, whereas gp41 is responsible for fusing the viral and host cell 
membranes (2). There is substantial evidence to indicate that receptor binding triggers 
20 conformational changes in the gpl20/gp41 complex, leading to activation of gp41 membrane 
fusion properties and ultimately invasion of the viral genome (3). By analogy with the 
'spring-loaded' model of the influenza virus hemagglutinin (4), gp41 activation is postulated 
to involve a complex set of structural changes from a native (prefusogenic) state to a fusion- 
active (fusogenic) conformation (5). 

25 Because the membrane anchor and the NH 2 -terminal fusion peptide of gp41 are 

embedded in the viral and target cell membranes respectively, formation of the trimer-of- 
hairpins structure is thought to appose two membranes for fusion (7). A monoclonal antibody 



recognizing this gp41 core binds specifically to the surface of HIV- 1 infected cells only after 
addition of soluble CD4 (8). This observation provides direct evidence that the trimer-of- 
hairpins structure represents the fusion-active state of gp41 (6). The structure of the HIV-1 
envelope glycoprotein in its native conformation is unknown. 

5 The viral envelope glycoprotein that mediates HIV-1 entry into target cells is an 

important target for humoral immunity during the natural course of infection (30). Current 
recombinant gpl20 or gp41 protein vaccine candidates are unable to elicit antibodies capable 
of neutralizing primary HIV-1 isolates from infected persons at significant titers (34). These 
antibodies bind well to the individual gpl20 and gp41 subunits but poorly to the membrane- 

10 associated, native envelope glycoprotein complex (Fig. 1) (35). In contrast, three infected 
human-derived neutralizing antibodies have been shown to recognize the native gpl20/gp41 
complex efficiently (10). Thus, it appears that an immune response to virions, rather than to 
normative forms of the envelope glycoprotein (called viral debris), leads to the production of 
functional antibodies. Since gpl20 is readily dissociated or shed from gp41 (37), the 

15 challenge of inducing protective humoral immunity is to preserve the native envelope 
structure, or crucial components thereof, in vaccine preparations. 

Because inhibition of gp41 -mediated membrane fusion has potential to offer a general 
strategy for the treatment or prevention of HIV-1 infection (9), structural information on the 
conformational change of the protein is crucial to guiding efforts to target this process for 

20 vaccine and antiviral drug development. In accordance with the present invention, it has been 
discovered that a peptide fragment derived from the COOH-terminal region of the gp41 
ectodomain, forms a parallel three-stranded, a-helical coiled coil and serves as a trimerization 
domain in the native gpl20/gp41 complex. A stable form of the trimeric coiled-coil domain 
elicits a strong antiviral antibody response against HIV-1 primary isolates from AIDS patients 

25 and thus has utility as a safe and practical HIV-1 vaccine immunogen. 
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SUMMARY OF THE INVENTION 

The present invention provides polypeptides comprising a stabilized HIV gp41 trimer. 
A subject polypeptide comprises three gp41 monomers that form a trimeric coiled coil, in a 
prefusogenic conformation. 

5 In one embodiment of the invention, the trimer is stabilized by using an isoleucine 

zipper. In another embodiment of the invention, the trimer is stabilized by chemical cross- 
linking of gp41 monomers. In yet another embodiment of the invention, the trimer is 
stabilized by point mutations in gp41 monomers. The subject polypeptides may be 
synthetically or recombinantly produced. 

10 In another aspect of the invention, there is provided a vaccine for the prevention or 

treatment of HIV infection or AIDS. The vaccine comprises a stabilized HIV gp41 trimer, 
comprising three gp41 monomers that form a trimeric coiled coil, in a prefusogenic 
conformation. The vaccine may be synthetically or recombinantly produced and may also 
comprise a suitable adjuvant. 

15 In still another aspect of the invention, there is provided a method of vaccinating an 

individual to prevent or treat HIV infection or AIDS. The method comprises administering to 
the individual an immunogenically effective amount of a composition comprising a subj ect 
vaccine. A subject vaccine may be administered in combination with a physiologically 
acceptable carrier or diluent. 

20 Also provided by the present invention is an antibody or binding portion thereof 

raised against a C terminus trimeric coiled coil motif of an HIV gp41 ectodomain. The 
antibody or binding portion thereof may be monoclonal or polyclonal. 

In a further aspect of the invention, there is provided a method of detecting HIV in a 
sample. The method comprises the steps of exposing a sample to a subject antibody or 
25 binding portion thereof, and identifying binding between HIV and the antibody or binding 
portion. 



A method of making a subject antibody or binding portion thereof is also provided. 
The method comprises exposing an antibody-producing cell to an isolated HIV gp41 trimer 
comprising three gp41 monomers that form a trimeric coiled coil, in an inactive 
conformation, under conditions effective to produce the antibody or binding portion thereof. 
Examples of antibody-producing cells for use in the method include e.g. cultured hybridoma 
cells and cells present in a living organism. 

A method of inhibiting HIV infectivity is also provided. The method comprises 
contacting HIV with a suitable amount of a subject antibody or binding portion thereof under 
conditions effective to inhibit HIV infectivity. A subject antibody or binding portion thereof 
may be administered to a host before or following exposure of the host to an infective HIV 
strain. 

In yet another aspect of the present invention, a method of screening for drugs which 
inhibit HIV infection is provided. The method comprises the steps of providing a gp41 trimer 
comprising three gp41 monomers that form a trimeric coiled coil in a prefusogenic 
conformation; exposing the trimer to a compound; and identifying those compounds which 
bind to the gp41 trimer. Preferably, a determination is made as to whether gp41 bound to the 
compound binds less effectively to gpl20 than gp41 not bound to the compound. 

In addition to HIV, the present invention is also directed to polypeptides comprising a 
stabilized viral envelope protein from other enveloped viruses. Thus, there are provided 
polypeptides comprising a stabilized viral envelope protein trimer wherein the trimer 
comprises a C-terminal heptad repeat region of the ectodomain in a prefusogenic 
conformation. Examples of other enveloped viruses include but are not limited to SIV, Mo- 
MLV, influenza virus, and Ebola virus. 

The present invention also provides vaccines comprising such stabilized viral 
envelope protein trimers, methods of vaccinating an individual to prevent or treat infection by 
an enveloped virus, antibodies (or binding portions thereof) raised against polypeptides 
comprising a stabilized viral envelope protein trimer, methods of making such antibodies or 
binding portions thereof, methods of inhibiting infectivity of an enveloped virus, and methods 
for screening for drugs which can inhibit infection by an enveloped virus. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 illustrates the trimer-of-hairpins structure of fusion-active gp41. The axial 
and side views of the gp41 core structure are shown (6). The NH 2 -terminal helices (blue) 
form an interior trimeric coiled coil, surrounded by an outer layer of three antiparallel COOH- 
5 terminal helices (red). This molecular arrangement is thought to force the viral and cellular 
membranes close together and thus overcome the energy barrier for membrane fusion (7). 

Figure 2A is a schematic view of gp41. The locations of the NH 2 - and COOH- 
terminal hydrophobic heptad repeats are indicated. The disulfide bond and four potential N- 
glycosylation sites are depicted. 

10 Figure 2B schematically depicts a chimera of gp41 (residues 536 to 666) and the 

GCN4-pII peptide (14). The amino acid sequence of the COOH-terminal extension of gp41 
is shown in an expanded view. Residues in the a and d positions of gp41 residues, and 
GCN4-pII in heptad register, are indicated. A continuous helix is assumed between the 
gp41 coiled coil and GCN4-pII. The locations of serine substitutions for He 573 and Leu 576 in 

1 5 the NH 2 -terminal heptad-repeat region are indicated by arrows. 

Figure 2C is a photographic reproduction of an Coomassie blue stained SDS- 
polyacrylamide gel run with the gp41/GCN4-pII chimera, digested with trypsin for the times 
indicated. The sizes of protein molecular weight markers (in kDa) are indicated at the left. 

Figure 3A shows the helical wheel projection of C45-pII. View is from the NH 2 
20 terminus. The amino acid sequence of C45 (residues 624 to 668) is shown in red. The 
residues in black are from GCN4-pII. The residue in blue corresponds to a conservative 
leucine substitution for Met 629 , made to facilitate protein production. 

Figure 3B shows a CD spectrum of a 10 uM solution of C45-pII at 0°C in PBS (pH 

7.0). 



25 



Figure 3C shows sedimentation equilibrium data (17,000 rpm) of C45-pII at —10 uM 
at 4°C in PBS (pH 7.0). The natural logarithm of the absorbance at 234 nm is plotted against 



the square of the radial position. The random distributions of the residuals indicate that the 
data fit well to an ideal single-species model. 

Figure 4A shows the amino acid sequence of C52 with the a and d positions of the 
heptad repeat indicated above the sequence. The sequence of the C45 peptide is also 
indicated. 

Figure 4B shows the CD spectra of C52 (circles) and C45 (squares) at 0°C in PBS 
(pH 7.0) at 100 uM peptide concentration. 

Figure 4C shows sedimentation equilibrium data (19,000 rpm) of C52 at -30 uM at 
4°C in PBS (pH 7.0). The natural logarithm of the absorbance at 236 nm is plotted against 
the square of the radial position. The random distributions of the residuals indicate that the 
data fit well to an ideal single- species model. 

Figure 5 A schematically depicts the trimeric gpl20/gp41 complex in the native state. 
The COOH-terminal coiled coil (red) in gp41 is present, while the NH 2 -terminal heptad- 
repeat region (blue) and fusion peptide (brown) are involved in interaction with the gpl20 
subunit (green balls). 

Figure 5B schematically depicts the trimeric gpl20/gp41 complex following binding 
of gpl20 to cell- surface receptors, where gp41 undergoes a conformational change to the 
transient prehairpin intermediate, and the NH 2 -terminal coiled coil is formed. As a result, the 
fusion peptide region inserts into the cellular membrane at the top of the molecule. 

Figure 5C shows the trimeric gpl20/gp41 in the fusogenic state, where the COOH- 
terminal helices are packed in an antiparallel manner against the NH 2 -terminal coiled coil, 
advancing two lipid bilayers toward intimate approximation and fusion. Formation of the 
stable trimer-of-hairpins structure is likely to provide a ready source of the activation energy 
needed for this membrane apposition process. 

Figure 6A are the CD spectra showing folding of the C52 mutant peptides Q652L 
(triangles) and N656L (rhombi) at 0°C in PBS (pH 7.0) at 100 uM peptide concentration. 
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Figure 6B are thermal melts of Q652L (triangles) and N656L (rhombi) in PBS (pH 
7.0) at 100 uM peptide concentration. 

Figure 6C shows sedimentation equilibrium data (19,000 rpm) for Q652L (triangles) 
at -100 uM and N656L (rhombi) at -30 uM collected at 4°C in PBS (pH 7.0). For an ideal, 
5 single-species system, the plot of the natural logarithm of the absorbance against the square 
of the radial position is linear, with the slope proportional the molecular weight of the 
molecule (18). In a multi-species system, however, the plot becomes curved upward (18). 

Figure 7A illustrates the chimera of the C52 peptide (residues 624 to 675) and an 
isoleucine-zipper trimer (GCN4-pII). Met629 in the C52-pII molecule was mutated to 
10 leucine to facilitate protein production. The location of the C45 segment (residues 624 to 

668) is indicated. The gp41 residues are numbered according to their position in gpl60 of the 
HXB2 HIV-1 strain. 

Figure 7B is a helical wheel representation of C52-pII. The sequence of C52 is shown 
in red, while the residues in black are from GCN4-pII. A continuous helix is assumed 
1 5 between the C52 coiled coil and GCN4-pII. View is from the NH2 terminus. 

Figure 7C is a CD spectrum of a 5 mM solution of C52-pII at 4°C in PBS (pH 7.0). 

Figure 7D is a thermal melt of C52-pII monitored by CD at 222 ran in the presence of 
4 M GdmCl, a denaturant. 

Figure 7E shows sedimentation equilibrium data (16,000 rpm) of C52-pII at -50 mM 
20 at 20°C in PBS (pH 7.0). The natural logarithm of the absorbance at 254 nm is plotted 

against the square of the radial position. The random distributions of the residuals indicate 
that the data fit well to an ideal single-species model. The calculated data for dimeric and 
tetrameric models are indicated by dashed and solid lines, respectively. 

Figure 8A shows the frequency of occurrence of amino acids at positions 669 to 675 
25 of gpl60. The sequences of 213 fully sequenced M group HIV-1 strains (HIV Sequence 

Database [1998/1999 alignments], Los Alamos National Laboratory, http:hiv-web.lanl.gov) 



-8- 

were analyzed. The amino acids occurring at positions 669 to 675 are shown with the 
number of times they occur at these positions. Periods indicate stop codons. 



DETAILED DESCRIPTION OF THE INVENTION 

In accordance with the present invention, it has been discovered that favorable 

5 presentation of the gp4 1 COOH-terminal coiled-coil region of HIV is important for inducing 
antibodies that neutralize representative primary HIV-1 isolates in primary cell cultures. The 
present invention therefore overcomes three major obstacles in ongoing HIV-1 vaccine 
development. In the first instance, a natural infection with HIV-1 stimulates only weak 
functional antibody responses, most likely because the trimeric envelope glycoprotein spikes 

1 0 are poorly immunogenic (30). In accordance with the present invention, a stable trimerization 
domain of the native gpl20/gp41 complex acts as an immunogen that provokes a highly 
focused and strong humoral immune response. In addition, antibodies produced in natural 
infection or typical vaccination protocols bind weakly to primary clinical isolates and are 
therefore of limited efficacy in controlling the virus (55). Through immunization with a 

1 5 stable form of a surface-exposed gp4 1 coiled coil, provided by the present invention, 

neutralizing antibodies have been generated which bind to primary viruses. Further, the 
ability to constantly select for viral variants is a defense mechanism by which HIV-1 resists 
or evades humoral immunity after either natural or experimental infection or envelope-based 
vaccination. In accordance with the present invention, the preservation of a well-conserved 

20 envelope protein structure in vaccine candidates as described herein is useful in surmounting 
the barrier of viral antigenic variation and thus preventing an initial HIV-1 infection. 

The ectodomain core structures of the membrane fusion subunits from many 
enveloped viruses, including but not limited to HIV-1, HIV-2, SIV, Mo-MLV, influenza 
virus, and Ebola virus, are remarkably similar (reviewed in Chan & Kim, 1999; Skehel & 
25 Wiley, 1998). In all cases, an interior, trimeric coiled coil (formed by the N-terminal heptad- 
repeat regions) is surrounded by helices derived from the C-terminal end of the ectodomains, 
packed in an antiparallel manner around the outside of the coiled coil (see Figure 1). A large 
body of evidence suggests that viral fusion proteins undergo a similar conformational change 
to become fusion active (Chan, D.C. & Kim, P.S., 1988 "HIV entry and its inhibition," Cell 



93: 681-684; Skehel, J.J. & Wiley, D.C. , 1998 "Coiled coils in both intracellular vesicle and 
viral membrane fusion," Cell 95: 871-874). Thus, the flip-flop molecular mechanism for 
activation and refolding of the viral membrane fusion protein is likely to be a general 
mechanism of infection by enveloped viruses. The C-terminal heptad-repeat regions of other 
5 viral fusion proteins such as those found in SIV, Mo-MLV, and Ebola virus also likely form 
trimerization and folding domains in their prefusogenic conformations. 

In a first aspect of the invention, there is provided a polypeptide comprising a 
stabilized viral envelope protein trimer. The protein trimer comprises a C-terminal heptad 
repeat region of the ectodomain in a prefusogenic conformation. Examples of enveloped 

=.10 viruses from which a subject polypeptide may be derived include, but are not limited to HIV- 

5 1 , HI V-2, SIV, Mo-MLV, influenza virus, and Ebola virus. 

In a preferred embodiment, a subject polypeptide comprises a stabilized HIV gp41 
trimer comprising three gp41 monomers that form a trimeric coiled coil in a prefusogenic 
conformation. In accordance with the present invention, such a polypeptide may be obtained 
15 by isolating and stabilizing the carboxyl-terminal core region of the gp41 ectodomain. The 
terms "carboxyl-terminal core region of the gp41 ectodomain" and "C-terminus trimeric 
coiled coil motif of an HIV gp41 ectodomain" are used interchangeably herein. The terms 
"prefusogenic" and "native" are also used interchangeably herein in describing a subject 
stabilized HIV gp4 1 trimer or other viral envelope protein trimer. 

20 Conventional recombinant schemes may be used to obtain an isolated polypeptide 

comprising a stabilized viral envelope protein trimer, wherein the trimer comprises a C- 
terminal heptad repeat region of the ectodomain in a prefusogenic conformation. The nucleic 
acid and amino acid sequences of different envelope proteins are known. For example, the 
nucleic acid and amino acid sequences for HIV-1, HI V-2, and SIV may be found in the HIV 

25 Sequence Database, Human Retrovirus and AIDS, 2000 ed., Los Alamos National 

Laboratory. The nucleotide and amino acid sequences for Mo-MLV, influenza virus, and 
Ebola virus, as well as other enveloped viruses, are available through Medline 
(www.ncbi.nlm.nih.gov/pubmed). These disclosures, as well as all other published references 
cited herein, are incorporated by reference as if fully set forth. The C-terminal heptad repeat 
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region of the ectodomain of enveloped viruses is easily discernible from the published 
sequences. 

Similarly, conventional recombinant schemes may be used to obtain the isolated HIV 
gp41 trimer. In addition to the HIV Sequence Database and Medline, the complete nucleotide 
5 sequences of the HIV viruses had been previously reported by several investigators (Ratner et 
al., "Complete Nucleotide Sequence of the AIDS virus, HTLV-III," Nature 3 13:277-84 
(1985); Muesing et al., "Nucleic Acid Structure and Expression of the Human 
AIDS/lymphadenopathy Retrovirus," Nature 313:450-458 (1985); Wain-Hobson et al., 
"Nucleotide Sequence of the AIDS Virus, LAV," Cell 40:9-17 (1985), which are hereby 

10 incorporated by reference). The RNA of the HIV-1 and HIV-2 viruses possess the following 
gene regions: so-called long terminal repeats at each end of the genome, gag,pol, env, and 
nef. The gag gene encodes the core proteins p24 and pi 7. The pol gene encodes the reverse 
transcriptase, RNAse H, and integrase. The gene nef encodes a protein having a regulatory 
function. The env gene encodes the glycoproteins of the viral coat, namely gp41 and gpl20. 

1 5 The env gene has been associated particularly with antigenicity and infectivity . 

Given the knowledge of nucleic acid sequences encoding HIV, as well as other 
enveloped viruses, one of ordinary skill in the art can readily insert coding sequence for the 
gp41 glycoprotein or other viral envelope trimer comprising a C-terminal heptad repeat 
region of the ectodomain in a prefusogenic conformation, directly into a host cell or into an 
20 intermediate expression vector. Coding sequences corresponding to the HIV gp41 

glycoprotein are readily available and may be obtained e.g., from the NIH AIDS Research 
and Reference Reagent Program Catalogue, Rockville, MD. Coding sequences for other 
enveloped viruses may be obtained from sources such as the American Type Culture 
Collection (ATCC), Manassas, Virginia. 

25 Various truncated fragments of the nucleotide sequence encoding the gp41 

glycoprotein may be used in accordance with the present invention. For example, coding 
sequence corresponding to amino acid residues 536-666, 640-658, or 624-669 (numbered 
according to their position in gpl60, the HXB2 HIV-1 strain) may be used. Fragments of the 
gp41 glycoprotein may be obtained by truncating the 5', 3', or both 5' and 3' ends of the 
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gp41 coding sequence by genetic manipulation. In vitro mutagenesis is helpful for 
introducing restriction sites. Similarly, truncated fragments of other enveloped virus C- 
terminal heptad repeat regions of the ectodomain may be used. There are various 
commercially available kits particularly suited for this application such as the T7-Gene in 
5 vitro Mutagenesis Kit (USB, Cleveland, OH) and the QuickChange Site Directed 

Mutagenesis Kit (Stratagene, San Diego, CA). Alternatively, PCR primers can be derived to 
allow direct amplification of a particular portion of a gp41 coding sequence or C-terminal 
heptad repeat region. 

The carboxyl-terminal core region of the gp41 ectodomain (C-terminus trimeric coiled 
10 coil motif of an HIV gp41 ectodomain) may be stabilized by e.g., insertion of an isoleucine 
zipper. The C-terminal heptad-repeat regions of the ectodomain of other enveloped viruses 
may also be stabilized by insertion of an isoleucine zipper. Thus, for example, an isoleucine 
zipper described in P.B. Harbury, et al. (1994) Nature 371 :80, may be inserted near the C- 
terminus of the coiled coil segment. Preferably, the isoleucine zipper is inserted in heptad 
15 register and before residue 684 in HIV (residues numbered according to their position in gp 
160, the HXB2 HIV-1 strain), which residue immediately precedes the transmembrane 
anchor. An isoleucine zipper may be inserted anywhere between residues 670 and 684 in 
HIV. Alternatively, an isoleucine zipper may be placed near the N-terminus of the C- 
terminal trimeric coiled coil of an HIV ectodomain or near the N-terminus of a C-terminal 
20 heptad-repeat region of the ectodomain of other enveloped viruses. 

The carboxyl-terminal core region of the gp41 ectodomain may also be stabilized by 
cross linking of gp41 monomers. Similarly, the C-terminal heptad repeat regions of the 
ectodomain of other viral envelope proteins may be stabilized by cross linking. Such cross 
linking can be accomplished using well known methods such as those described in Houston, 
25 M.E., Jr. et al., (1995) J. Peptide Science 7:274-282. 

The carboxyl-terminal core region of the gp41 ectodomain may also be stabilized by 
introducing one or point mutations in one or more gp41 monomers. Thus for example, Ser- 
649, Gln-652, Asn-656, and Glu-659 (numbered according to their position in gpl60 in the 
HXB2 HIV-1 strain) may be mutated to the hydrophobic residue isoleucine. Methods of 
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inducing point mutations are well known and include e.g., single stranded mutagenesis as 
described in T.A. Kunkel, et al., Methods Enzymol. 154, 367 (1987). The C-terminal heptad 
repeat region of the ectodomain of other enveloped viruses may also be stabilized by 
introducing one or more point mutations therein. Any and all possible mutations are 
5 contemplated by the present invention as long as the mutation does not destabilize the HIV 
trimer comprising three gp41 monomers which form a trimeric coiled coil in a prefusogenic 
conformation or does not destabilize a trimer comprising a C-terminal heptad-repeat region of 
the ectodomain in a prefusogenic conformation, of another enveloped virus. 

Various expression vectors are known in the art, including plasmids, viral vectors, and 
10 bacteriophage vectors. The host cell to be transformed with the nucleic acid molecule may be 
most amenable to transformation with a particular type of expression vector. Plasmid vectors 
readily transform bacterial host cells. Viral vectors readily transform many mammalian cells. 

A nucleic acid molecule coding for a subject polypeptide may be inserted into an 
expression vector using standard cloning procedures readily known in the art. This generally 

15 involves the use of restriction enzymes and DNA ligases, as described by Maniatis et al. 
(1982). U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby incorporated by 
reference, describes the production of expression systems in the form of recombinant 
plasmids using restriction enzyme cleavage and ligation with DNA ligase. These 
recombinant plasmids are then introduced by means of transformation and replicated in 

20 unicellular cultures including prokaryotic organisms and eukaryotic cells grown in tissue 
culture. 

Suitable vectors include, but are not limited to, the following viral vectors such as 
lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid vectors such as pBR322, 
pBR325, P ACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, pLG339, pR290, P KC37, 
25 pKClOl, SV 40, pBluescript II SK +/- or KS +/- (see "Stratagene Cloning Systems" Catalog 
(1993) from Stratagene, La Jolla, Calif, which is hereby incorporated by reference), pQE, 
pIH821, pGEX, pET series (see F.W. Studier et. al., "Use of T7 RNA Polymerase to Direct 
Expression of Cloned Genes," Gene Expression Technology vol. 185 (1990), which is hereby 
incorporated by reference), and any derivatives thereof. Recombinant molecules can be 
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introduced into cells via transformation, particularly transduction, conjugation, mobilization, 
or electroporation. The DNA sequences are cloned into the vector using standard cloning 
procedures in the art, as described by Sambrook et al, Molecular Cloning: A Laboratory 
Manual, Cold Springs Laboratory, Cold Springs Harbor, New York (1989), which is hereby 
5 incorporated by reference. 

A variety of host- vector systems may be utilized to express the protein-encoding 
sequence(s). Preferred vectors include a viral vector, plasmid, cosmid or an oligonucleotide. 
Primarily, the vector system must be compatible with the host cell used. Host-vector systems 
include but are not limited to the following: bacteria transformed with bacteriophage DNA, 

10 plasmid DNA, or cosmid DNA; microorganisms such as yeast containing yeast vectors; 

mammalian cell systems infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell 
systems infected with virus (e.g., baculovirus); and plant cells infected by bacteria. The 
expression elements of these vectors vary in their strength and specificities. Depending upon 
the host- vector system utilized, any one of a number of suitable transcription and translation 

1 5 elements can be used. 

Different genetic signals and processing events control many levels of gene 
expression (e.g., DNA transcription and messenger RNA (mRNA) translation). Transcription 
of DNA is dependent upon the presence of a promotor which is a DNA sequence that directs 
the binding of RNA polymerase and thereby promotes mRNA synthesis. The DNA 
20 sequences of eukaryotic promoters differ from those of prokaryotic promoters. Furthermore, 
eukaryotic promoters and accompanying genetic signals may not be recognized in or may not 
function in a prokaryotic system, and, further, prokaryotic promoters are not recognized and 
do not function in eukaryotic cells. 

Similarly, translation of mRNA depends upon the presence of the proper signals. 
25 Efficient translation of mRNA requires a ribosome binding site. This sequence is a short 
nucleotide sequence of mRNA that is located before the start codon, usually AUG, which 
encodes the amino-terminal methionine of the protein. The ribosome binding sites are 
complementary to the 3 '-end of the 16S rRNA (ribosomal RNA) and probably promote 
binding of mRNA to ribosomes by duplexing with the rRNA to allow correct positioning of 



- 14- 

the ribosome. For a review on maximizing gene expression, see Roberts and Lauer, Methods 
in Enzymology, 68:473 (1979), which is hereby incorporated by reference. 

A preferred embodiment of the invention is where the gene is functionally linked to a 
promoter. Promoters vary in their "strength" (i.e. their ability to promote transcription). For 
5 the purposes of expressing a cloned gene, it is desirable to use strong promoters in order to 
obtain a high level of transcription and, hence, expression of the gene. Depending upon the 
host cell system utilized, any one of a number of suitable promoters may be used. For 
instance, when cloning in E. coli, its bacteriophages, or plasmids, promoters such as the T7 
phage promoter, lac promoter, trp promoter, recA promoter, ribosomal RNA promoter, the 
10 PR and PL promoters of coliphage lambda and others, including but not limited, to lacUV5, 
ompF, bla, lpp, and the like, may be used to direct high levels of transcription of adjacent 
DNA segments. Additionally, a hybrid trp-lacUV5 (tac) promoter or other E. coli promoters 
produced by recombinant DNA or other synthetic DNA techniques may be used to provide 
for transcription of the inserted gene. 

1 5 Once the isolated nucleic acid molecule encoding the HIV gp41 glycoprotein or C- 

terminal heptad repeat of another enveloped virus, has been cloned into an expression system, 
it is ready to be incorporated into a host cell. Such incorporation can be carried out by the 
various conventional forms of transformation, including those dependent upon the vector/host 
cell system. Suitable host cells include, but are not limited to, bacteria, virus, yeast, 

20 mammalian cells, insect, and the like. 

Biological markers can be used to identify the cells carrying recombinant DNA 
molecules. In bacteria, these are commonly drug-resistance genes. Drug resistance is used to 
select bacteria that have taken up cloned DNA from the much larger population of bacteria 
that have not. In the early mammalian gene transfer experiments involving viral genes, the 
25 transfer of exogenous DNA into cells was detected because the DNA had a biological 

activity, it led to production of infectious virus or produced stable changes in the growth 
properties of the transfected cells. It was then discovered that the DNA tumor virus, herpes 
simplex virus (HSV), contained a gene encoding the enzyme thymidine kinase (the tk gene). 
The HSV tk gene can be used as a selectable genetic marker in mammalian cells in much the 
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same way that drug-resistance genes worked in bacteria, to allow rare transfected cells to 
grow up out of a much larger population that did not take up any DNA. The cells are 
transferred to selective growth medium, which permits growth only of cells that took up a 
functional tk gene (and the transferred DNA of interest). Various dominant selectable 
5 markers are now known in the art, including: aminoglycoside phosphotransferase (APH), 

using the drug G418 for selection which inhibits protein synthesis; the APH inactivates G418; 
dihydrofolate reductase (DHFR):Mtx-resistant variant, using the drug methotrexate (Mtx) for 
selection which inhibits DHFR; the variant DHFR is resistant to Mtx; hygromycin-B- 
phosphotransferase (HPH), using the drag hygromycin-B which inhibits protein synthesis; the 

10 HPH inactivates hygromycin B; thymidine kinase (TK), using the drug aminopterin which 

inhibits de novo purine and thymidylate synthesis; the TK synthesizes thymidylate; xanthine- 
guanine phosphoribosyltransferase (XGPRT), using the drug mycophenolic acid which 
inhibits de novo GMP synthesis; XGPRT synthesizes GMP from xanthine; and adenosine 
deaminase (ADA), using the drug 9-b-D-xylofuranosyl adenine (Xyl-A) which damages 

1 5 DNA; the ADA inactivates Xyl-A. 

In addition to the recombinant DNA techniques described above for preparing a 
carboxy 1-terminal core region of the gp4 1 ectodomain (C-terminus trimeric coiled coil motif 
of an HIV gp41 ectodomain), or C-terminal heptad repeat region of the ectodomain of another 
enveloped virus, such proteins may be synthesized using any number of well known 

20 techniques. For example, a carboxyl-terminal core region of the gp41 ectodomain may be 

prepared using the solid-phase synthetic technique initially described by Merrifield, in J. Am. 
Chem. Soc. 55:2149-2154 (1963). Other peptide synthesis techniques may be found in M. 
Bodanszky et al. Peptide Synthesis, John Wiley & Sons, 2d Ed. (1976) and other references 
readily available to skilled artisans. A summary of polypeptide synthesis techniques can be 

25 found in J. Stuart and J. D. Young. Solid Phase Peptide Synthesis, Pierce Chemical 

Company, Rockford, 111., (1984). Peptides may also be synthesized by solution methods as 
described in The Proteins, Vol. II, 3d Ed., Neurath, H. et. al., Eds., p. 105-236, Academin 
Press, New York, N. Y. (1976). Appropriate protective groups for use in different peptide 
syntheses are described in the above-mentioned texts as well as in J. F. W. McOmie, 

30 Protective Groups in Organic Chemistry, Pelenu Press, New York, N.Y. (1973). 
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A carboxyl-terminal core region of the gp4 1 ectodomain (C-terminus trimeric coiled 
coil motif of an HIV gp41 ectodomain) or C-terminal heptad repeat region of the ectodomain 
of another enveloped virus, may also be prepared by chemical or enzymatic cleavage from 
larger portions of the corresponding protein. 

5 Purified protein may be obtained by several methods. The protein or polypeptide of 

the present invention is preferably produced in purified form (preferably at least about 80%, 
more preferably 90%, pure) by conventional techniques. Typically, the protein or 
polypeptide of the present invention is secreted into the growth medium of recombinant host 
cells. Alternatively, the protein or polypeptide of the present invention is produced but not 

10 secreted into growth medium. In such cases, to isolate the protein, the host cell carrying a 
recombinant plasmid is propagated, lysed by sonication, heat, chemical treatment, and the 
homogenate is centrifuged to remove cell debris. The supernatant is then subjected to 
sequential ammonium sulfate precipitation. The fraction containing the polypeptide or 
protein of the present invention is subjected to gel filtration in an appropriately sized dextran 

15 or poly aery lamide column to separate the proteins. If necessary, the protein fraction may be 
further purified by HPLC. 

Once a subject HIV gp41 trimer or C-terminal heptad repeat region of the ectodomain 
of another enveloped virus, has been isolated, it can be used to inhibit viral infection in a 
patient by vaccinating the patient. A patient may be administered an immunogenically 

20 effective amount of a subject stabilized HIV gp41 trimer comprising three gp41 monomers 
that form a trimeric coiled coil, in a prefusogenic conformation. Similarly, a patient may be 
administered an immunogenically effective amount of a C-terminal heptad repeat region of 
the ectodomain of another enveloped virus. By "immunogenically effective amount" it is 
meant an amount which is sufficient to induce protective humoral immunity in the subject. 

25 Such an amount is empirical and depends on factors such as the age, weight, gender, and 

diseased state of the subject. Generally speaking however, a polypeptide concentration in the 
range of from about 1 uM to about 100 mM may be administered. 

Regardless of whether the polypeptides are used for vaccination, treatment or 
diagnosis, they can be administered orally, parenterally, subcutaneously, intravenously, 
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intramuscularly, intraperitoneally, by intranasal instillation, by intracavitary or intravesical 
instillation, intraocularly, intraarterially, intralesionally, or by application to mucous 
membranes, such as, that of the nose, throat, and bronchial tubes. They may be administered 
alone or with pharmaceutically or physiologically acceptable carriers, excipients, or 
5 stabilizers, and can be in solid or liquid form such as, tablets, capsules, powders, solutions, 
suspensions, or emulsions. 

The solid unit dosage forms can be of the conventional type. The solid form can be a 
capsule, such as an ordinary gelatin type containing the antibodies or binding portions thereof 
of the present invention and a carrier, for example, lubricants and inert fillers such as, lactose, 
1 0 sucrose, or cornstarch. In another embodiment, these compounds are tableted with 

conventional tablet bases such as lactose, sucrose, or cornstarch in combination with binders 
like acacia, cornstarch, or gelatin, disintegrating agents, such as cornstarch, potato starch, or 
alginic acid, and a lubricant, like stearic acid or magnesium stearate. 

The polypeptide of the present invention may also be administered in injectable 
15 dosages by solution or suspension of these materials in a physiologically acceptable diluent 
with a pharmaceutical carrier. Such carriers include sterile liquids, such as water and oils, 
with or without the addition of a surfactant and other pharmaceutically and physiologically 
acceptable carrier, including adjuvants, excipients or stabilizers. Illustrative oils are those of 
petroleum, animal, vegetable, or synthetic origin, for example, peanut oil, soybean oil, or 
20 mineral oil. In general, water, saline, aqueous dextrose and related sugar solution, and 
glycols, such as propylene glycol or polyethylene glycol, are preferred liquid carriers, 
particularly for injectable solutions. 

For use as aerosols, the polypeptide of the present invention in solution or suspension 
may be packaged in a pressurized aerosol container together with suitable propellants, for 
25 example, hydrocarbon propellants like propane, butane, or isobutane with conventional 
adjuvants. The materials of the present invention also may be administered in a non- 
pressurized form such as in a nebulizer or atomizer. 

Carrier molecules to which peptides of the invention are covalently linked 
(conjugated) are advantageously, non-toxic, pharmaceutically acceptable and of a size 
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sufficient to produce an immune response in mammals. Examples of suitable carrier 
molecules include tetanus toxoid, keyhole limpet hemocyanin (KLH), and peptides 
corresponding to T cell epitopes (that is, Tl and T2) of the gpl20 envelope glycoprotein that 
can substitute for non-AIDS virus-derived carrier molecules (Cease, Proc. Nat'l. Acad. Sci. 
5 (USA) 84:4249, 1987; Kennedy et al, J. Biol. Chem. 262:5769, 1987). Peptides can also be 
administered with a pharmaceutically acceptable adjuvant, for example, alum, or conjugated 
to other carrier molecules more immunogenic than tetanus toxoid. Linkage of a carrier 
molecule to a peptide of the invention can be direct or through a spacer molecule. Spacer 
molecules are, advantageously, non-toxic and well tolerated. 

10 In addition, the HIV gp41 C-terminal trimer or C-terminal heptad repeat region of the 

ectodomain of another enveloped vims, may be used to generate antibodies against the trimer 
in its inactive conformation. The antibodies can be either polyclonal or monoclonal 
antibodies. These antibodies and binding portions thereof, recognize and bind to a C- 
terminus trimeric coiled coil motif of an HIV gp41 ectodomain or C-terminal heptad repeat 

1 5 region of the ectodomain of another enveloped virus. Monoclonal antibody production may 
be effected by techniques which are well-known in the art. Basically, the process involves 
first obtaining immune cells (lymphocytes) from the spleen of a mammal (e.g., mouse) which 
has been previously immunized with the antigen of interest either in vivo or in vitro. The 
antibody-secreting lymphocytes are then fused with (mouse) myeloma cells or transformed 

20 cells, which are capable of replicating indefinitely in cell culture, thereby producing an 

immortal, immunoglobulin-secreting cell line. The resulting fused cells, or hybridomas, are 
cultured, and the resulting colonies screened for the production of the desired monoclonal 
antibodies. Colonies producing such antibodies are cloned and grown either in vivo or in 
vitro to produce large quantities of antibody. A description of the theoretical basis and 

25 practical methodology of fusing such cells is set forth in Kohler and Milstein, Nature 256:495 
(1975), which is hereby incorporated by reference. 



Mammalian lymphocytes are immunized by in vivo immunization of the animal (e.g., 
a mouse) with the protein or polypeptide of the present invention. Such immunizations are 
repeated as necessary at intervals of up to several weeks to obtain a sufficient titer of 
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antibodies. Following the last antigen boost, the animals are sacrificed and spleen cells 
removed. 

Fusion with mammalian myeloma cells or other fusion partners capable of replicating 
indefinitely in cell culture is effected by standard and well-known techniques, for example, by 
5 using polyethylene glycol ("PEG") or other fusing agents (See Milstein and Kohler, Eur. J. 
Immunol. 6:51 1 (1976), which is hereby incorporated by reference). This immortal cell line, 
which is preferably murine, but may also be derived from cells of other mammalian species, 
including but not limited to rats and humans, is selected to be deficient in enzymes necessary 
for the utilization of certain nutrients, to be capable of rapid growth, and to have good fusion 
10 capability. Many such cell lines are known to those skilled in the art, and others are regularly 
described. 

Procedures for raising polyclonal antibodies are also well known. Typically, such 
antibodies can be raised by administering the protein or polypeptide of the present invention 
subcutaneously to New Zealand white rabbits which have first been bled to obtain pre- 

15 immune serum. The antigens can be injected at a total volume of 100 ml per site at six 
different sites. Each injected material will contain adjuvants with or without pulverized 
acrylamide gel containing the protein or polypeptide after SDS-polyacrylamide gel 
electrophoresis. The rabbits are then bled two weeks after the first injection and periodically 
boosted with the same antigen three times every six weeks. A sample of serum is then 

20 collected 10 days after each boost. Polyclonal antibodies are then recovered from the serum 
by affinity chromatography using the corresponding antigen to capture the antibody. This 
and other procedures for raising polyclonal antibodies are disclosed in E. Harlow, et. al., 
editors, Antibodies: A Laboratory Manual (1988), which is hereby incorporated by reference. 

Human monoclonal antibodies may be readily obtained in humanized mice. Such 
25 mice are commercially available from e.g., Abgenix, Inc., Fremont, CA, and Medarex, Inc., 
Princeton, NJ. Human monoclonal antibodies may also be obtained using phage display 
libraries (Williamson, R.A., et al. 1998 J. Virol. 72:9413-9418). Phage display libraries are 
commercially available from Cambridge Antibody Technology, Cambridge, MA., and 
Biosyte, MD, and may be used in the present invention following the instructions of the 
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manufacturer. Phage display libraries offer the possibility of obtaining panels of monoclonal 
antibodies to multiple putative targets in a very short time frame (one to three months). 

In addition to utilizing whole antibodies, the processes of the present invention 
encompass use of binding portions of such antibodies. Such binding portions include Fab 
5 fragments, F(ab')2 fragments, and Fv fragments. These antibody fragments can be made by 
conventional procedures, such as proteolytic fragmentation procedures, as described in 
Goding, Monoclonal Antibodies: Principles and Practice, pp. 98-1 18, New York: Academic 
Press (1983), which is hereby incorporated by reference. The antibodies or binding portions 
thereof are particularly useful to inhibit HIV (or other enveloped virus) infectivity or for 
1 0 detecting the presence of HIV (or other enveloped virus) in a sample. 

Thus, in accordance with the present invention, there is provided a method of 
inhibiting infectivity of an enveloped virus. The method comprises the steps of contacting 
the enveloped virus with a suitable amount of a subject antibody or binding region thereof 
under conditions effective to inhibit infectivity of the enveloped virus. The antibody or 
15 binding portion thereof may be administered to a host before or following exposure of the 
host to the enveloped virus. 

In a preferred embodiment, there is provided a method of inhibiting HIV infectivity 
which comprises contacting HIV with a suitable amount of a subject antibody or binding 
region thereof under conditions effective to inhibit HIV infectivity. A subject antibody or 
20 binding portion thereof may be administered to a host before or following exposure of the 
host to an infective HIV strain. 

In another aspect of the invention, there is provided a method of detecting HIV or 
other enveloped virus in a sample. The method comprises the steps of exposing a sample to 
an appropriate subject antibody or binding portion thereof, followed by identification of 
25 binding between the enveloped virus and the corresponding antibody or binding portion 
thereof. Thus, in order to detect HIV in a sample, the sample is exposed to a subject HIV 
antibody, hereinbefore described, or a binding portion thereof, and binding between HIV and 
the antibody or binding portion thereof, is identified. In order to detect SIV in a sample, the 
sample is exposed to a subject SIV antibody, hereinbefore described, or binding portion 
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thereof, and binding between SIV and the antibody or binding portion thereof, is identified. 
In order to detect Mo-MLV in a sample, the sample is exposed to a subject Mo-MLV 
antibody, hereinbefore described, or binding portion thereof, and binding between Mo-MLV 
and the antibody or binding portion thereof, is identified. Likewise, in order to detect Ebola 
5 virus in a sample, the sample is exposed to a subject Ebola virus antibody, hereinbefore 
described, or binding portion thereof, and binding between Ebola virus and the antibody or 
binding portion thereof, is identified. 

The present invention also provides a method for screening for drugs which can 
inhibit enveloped virus infection. In accordance with the method, a stabilized viral envelope 
1 0 protein trimer comprising a C-terminal heptad repeat region of the ectodomain in a 

prefusogenic conformation, is exposed to test compounds in order to identify compounds 
which bind thereto. 

In a preferred embodiment, there is provided a method for screening for drugs which 
inhibit HIV infection. In accordance with the method, a gp41 trimer comprising three gp41 
15 monomers that form a trimeric coiled coil in a prefusogenic conformation, is exposed to test 
compounds in order to identify those compounds which bind to the gp41 trimer. In 
particular, the method identifies compounds where gp41, which is bound to the compound, 
binds less effectively to gpl20 than gp41 not bound to the compound. 

Although the invention has been described in detail for the purpose of illustration, it is 
20 understood that such detail is solely for that purpose, and variations can be made therein by 
those skilled in the art without departing from the spirit and scope of the invention. The 
invention is further illustrated by the following examples which are not intended in any way 
to limit the scope of the invention. 
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EXAMPLE 1 

Construction of a chimera of gp41 and GCN4-pII. Formation of native envelope 
glycoprotein trimers on infectious virions is thought to be mediated by a sequence within the 
gp41 ectodomain other than the NH 2 -terminal coiled-coil motif (Fig. 2 A) (10). Trimerization 
5 therefore may therefore depend on a hydrophobic heptad repeat located in the COOH- 

terminal region of the protein. Earlier studies indicate the native structure of the gpl20/gp41 
complex is labile (11), and is transformed by receptor binding into an energetically more 
stable fusogenic conformation (5-7). Experiments were thus designed in order to test whether 
stabilizing this COOH-terminal coiled-coil motif could capture the gp4 1 molecule in the 
10 native metastable fold. Accordingly, the gp41 ectodomain, lacking the NH 2 -terminal fusion 
peptide was engineered, by placing a soluble trimeric isoleucine zipper (GCN4-pII) in heptad 
register to the COOH terminus of the coiled-coil segment replacing the 1 7-residue region 
prior to the transmembrane anchor (Fig. 2B) 

The genes of gp41 (residues 536 to 666, numbered according to their position in 
15 gpl 60 the HXB2 HIV-1 strain) and GCN4-pII (14) were subcloned into the expression vector 
pTMHa [J. P. Staley and P. S. Kim, Protein Science 3, 1822 (1994)] to produce plasmid 
pRgp41pII (Fig. 2B). Met 629 in the pRgp41pII construct was mutated to leucine by single- 
stranded mutagenesis [T. A. Kunkel, J. D. Roberts, R. A. Zakour, Methods Enzymol. 154, 367 
(1987)]. An expression vector (pC45pII) encoding the C45-pII peptide was derived from 
20 pRgp41pII by PCR amplification followed by subcloning into pTMHa. The Kpnl site in 
pC45pII was replaced by the coding sequence for the residues Ala-Ser. Plasmid pC45 was 
derived from pC45pII. Plasmid pC52 was derived from pC45pII by the insertion of the 
appropriate DNA sequences encoding the residues Leu-Trp-Asn-Trp-Phe-Asn-Ile in place of 
GCN4-pII. Plasmid pC29p was derived from pC45pII. Single-letter abbreviations for amino 
25 acid residues are as follows: A, Ala; C, Cys; D, Asp; E, Glu, F, Phe; G, Gly; H, His; I, He; K, 
Lys; L, Leu; M, Met; N, Asn; P, Pro; Q, Gin; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, 
Tyr. 
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In addition, He 573 and Leu 576 were mutated in this chimera molecule to polar serine 
residues in order to destabilize the NH 2 -terminal coiled coil because it forms the centre of 
fusion-active gp41 (6, 12). The resulting chimera protein was designated rgp41-pll. 

The rgp41-pll protein was produced by bacterial expression, purified by reverse-phase 
5 high-performance liquid chromatography, and refolded in vitro. Proteins were expressed in 
the E. coli strain BL21(DE3)/pLysS (Novagen). The chimeric proteins and peptides were 
purified essentially as described for the N36-pl peptide [W. Shu, H. Ji, M. Lu, Biochemistry 
38, 5378 (1999)]. Cys-containing proteins were subjected to air oxidation in 6 M guanidine 
hydrochloride (GdmCl), and 100 mM tris-HCl (pH 8.6) to permit disulfide bond formation 
10 prior to cyanogen bromide treatment. Purified rgp41-pll solubilized in 6 M GdmCl and 50 
mM tris-HCl (pH 8.0) was refolded by dilution into 50 mM tris-HCl (pH 8.0). Identity of the 
peptides was confirmed by mass spectrometry (Perceptive Biosystems Voyager Elite), and all 
molecular masses were found to be within 2 daltons of the expected mass. 

The rgp41-pll protein forms an insoluble aggregate under physiological conditions. 

15 Protease treatment with trypsin resulted in a soluble peptide fragment that extends from Ser 618 
to the COOH-terminal end of the recgp41-pll construct, including the gp41 segment (residues 
618 to 666) on the NH 2 terminus and the entire GCN4 coiled coil on the COOH terminus 
(Fig. 2C). To facilitate further studies, a molecule denoted C45-pII, was constructed in which 
the C45 segment (residues 624 to 668) of gp41 is fused in frame to GCN4-pII (Fig. 3 A). The 

20 circular dichroism (CD) spectrum of C45-pII demonstrates that the peptide is fully helical in 
solution (Fig. 3B). 

EXAMPLE 2 

Alpha-helical structure of the chimera. In an attempt to define a smaller subdomain 
for high-resolution crystallographic studies, C45-pII was subjected to proteolysis by 
25 proteinase K. Trypsin (Worthington) or proteinase K (Boehringer Mannheim) digestions were 
performed with 1 mg/ml protein and 0.01 mg/ml at room temperature in 50 mM tris-HCl (pH 
8.0) and were quenched with 4 mM phenylmethylsulphonyl fluoride (Sigma). The 
proteolytic fragments were separated and purified by reverse-phase high-performance liquid 
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chromatography. The purified fragments were characterized by NH 2 -terminal sequencing and 
mass spectrometry. 

The digestion yields a slightly shorter 56-residue fragment corresponding to residues 
640 to 668 of gp41 and residues 1 to 27 of GCN4-pII. This fragment, designated C29-p, 
5 represents a 1 6-residue truncation from the NH 2 terminus and a 6-residue truncation from the 
COOH terminus. Like the longer C45-pII peptide, C29-p forms a discrete trimer with 100% 
helix content. 

CD spectra were acquired on an Aviv 62DS circular dichroism spectrometer. 
Measurements of [Q] 222 were made at 0°C in 50 mM sodium phosphate (pH 7.0) and 150 mM 

10 NaCl. A [0] 222 value of -33,000 degrees cm 2 dmol" 1 was taken to correspond to 1 00% helix 
[Y. H. Chen, J. T. Yang, K. H. Chau, Biochemistry 13, 3350 (1974)]. Thermal melts were 
performed in the same buffer by measuring [0] 222 as a function of temperature [C. R. Cantor 
and P. R. Schimmel, Biophysical Chemistry (Freeman, New York, 1980), vol. 3, p. 1 132]. 
Peptide concentrations were determined spectrophotometrically in 6 M GdmCl [H. Edelhoch, 

15 Biochemistry 6, 1948 (1967)]. 

Sedimentation equilibrium measurements were carried out on a Beckman XL -A 
analytical ultracentrifuge equipped with an An-60 Ti rotor. All experiments were performed 
in 50 mM sodium phosphate (pH 7.0) and 150 mM NaCl at 4°C and peptide concentrations of 
10, 30, and 100 uM. Data for C45-pII and C29-p showed no systematic dependence of 

20 molecular weight on concentration. At concentrations below 25 uM, the molecular weights 
of the C52 and Q652L peptides varied systematically between monomer and trimer values. 
Data for N656L were not fit to a single-species model [K. E. Van Holde, The Proteins, H. 
Neurath and R. L. Hill, Eds. (Academic Press, San Francisco, 1975), vol. 1, pp. 225] and 
showed systematic dependence of molecular weight on concentration. The molecular weights 

25 derived from the data sets (followed by expected molecular weight for a trimer) and [the rotor 
speeds in thousand revolutions per minute at which data were collected] are as follows: C45- 
pIl-30,342 (28,463) [17, 20]; C29-p-21,748 (20,150) [19, 22]; C52-1 8,028 (19,464) [19, 
22]; Q652L-1 7,708 (19,419) [19, 22]. Sedimentation equilibrium experiments indicate that 
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the C45-pII peptide sediments as a trimeric species (Fig. 3C) (77). Thus, the chimera of the 
gp41 heptad repeat and isoleucine zipper can form a protease-resistant, three-helix structure. 

EXAMPLE 3 

Folding of the C52 coiled-coil domain. In contrast to the C45-pII chimera, the 
5 isolated C45 peptide (residues 624 to 669 of gp41) displays little secondary structure, as 
judged by CD spectroscopy (15). To investigate the possibility that the heptad-repeat motif 
might be stabilized in the context of the longer coiled coil, a 52-residue peptide, designated 
C52, that contains the entire C45 region plus 7 residues on the COOH terminus (Fig. 4 A), 
was studied. The C52 peptide is -70% helical and exhibits a reversible thermal unfolding 
10 transition at 100 uM peptide concentration in neutral pH phosphate-buffered saline (PBS) at 
0°C (Fig. 4B, C). Because the mutant envelope glycoprotein exhibits substantial cell-surface 
expression, gpl60 precursor processing, CD4 binding, gpl20/gp41 association, and soluble 
CD4-induced shedding (i 7), the fusion-defective Asn 656 to Leu mutation is unlikely to alter 
structural features sufficiently to disrupt the native gpl20/gp41 complex. 

1 5 Sedimentation equilibrium measurements indicate that the C52 peptide is a trimer 

between 30 and 100 uM (Fig. 4D). The molecular weight of the C52 peptide systematically 
decreases at concentrations below 25 pM (presumably because of dissociation in the low 
micromolar concentration range). Thus, the C52 peptide forms a labile three-stranded coiled 
coil in isolation. The invariant Trp 670 and Phe 673 residues in positions a and d of the C52 

20 peptide may contribute to the folding free energy of the coiled-coil trimer. 

EXAMPLE 4 

Model for the native conformation of gp41. The finding that the C52 peptide can 
fold into an a-helical trimer leads to a proposal of a model for the native conformation of 
gp41 (see Fig. 5). The COOH-terminal regions of the three gp41 chains form a parallel 
25 trimeric coiled coil that is ~9 residues from the viral membrane end of the molecule. The 

NH 2 -terminal heptad-repeat regions make contacts with the gpl20 subunits, and are thus held 
in an non-coiled-coil conformation (Fig. 5A). The gpl20 chains may also contribute to the 
native structure but primarily form three membrane distal globular domains containing the 
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receptor-binding sites. This native conformation is presumably stabilized by interactions 
between gpl20 and gp41 as well as burial of the apolar fusion peptide in the interior of the 
gpl20/gp41 complex. According to present theories (3, 5), binding of gpl20 to CD4 and a 
chemokine receptor alters the intersubunit interactions and thus initiates cooperative folding 
of the gp41 NH 2 -terminal coiled coil, leading to the exposure of the fusion peptide and its 
insertion into the target membrane. As a result, a transient gp41 intermediate exists as a 
prehairpin molecule simultaneously in both the viral and cellular membranes (Fig. 5B) (7(5). 
At the COOH-terminal end the coiled-coil trimer then refolds into three helices, allowing 
them to pack against the NH 2 -terminal trimeric coiled coil in an inverted orientation 
(Fig. 5C). Thus, the temporal disassembly of the native coiled-coil domain for generating the 
fusogenic hairpin structure serves as a flip-flop mechanism for activating the membrane 
fusion potential of gp41 . 

EXAMPLE 5 

Correlation with genetic studies. In the trimer-of-hairpins structure of fusion-active 
gp41, the conserved Gin 652 and Asn 656 residues at the d and a positions of the outer helix 
respectively are packed into a hydrophobic groove on the surface of the NH 2 -terminal coiled 
coil (<5). Whereas the Asn 656 to Leu mutation abolishes membrane fusion, the Gin 652 to Leu 
mutation increases HIV-1 infectivity (17). To define the basis for these in vivo phenotypes, 
the role of Gin 652 and Asn 656 on coiled-coil interactions was investigated by characterizing 
variants of the C52 peptide: Q652L andN656L. The Q652L peptide was derived from the 
C52 peptide (residues 624-675) by substituting a leucine for glutamine at position 652. The 
N652L peptide was also derived from the C52 peptide by substituting a leucine for asparagine 
at position 656. Both peptides were made by single stranded mutagenesis using the method 
described in Kunkel, T.A., et al., Methods Enzymol. 154, 367 (1987), and produced by 
bacterial expression. On the basis of CD measurements at 100 uM peptide concentration in 
PBS (pH 7.0) at 0°C, Q652L is -70% helical and N656L is fully helical (Fig. 6A). Under 
these conditions, Q652L displays a broad thermal unfolding transition and N656L melts 
cooperatively with a T m of 43 °C (Fig. 6B). The Q652L peptide sediments as a trimer at 
concentrations between 30 and 100 jjM (Fig. 6C). The N656L peptide does not sediment as a 
unique oligomeric species; its apparent molecular weight changes significantly with peptide 
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concentration (Fig. 6C). The Gin 652 to Leu substitution therefore essentially exerts no effects 
on formation of C52 trimers, while the Asn 656 to Leu substitution imparts strong helical 
character at the expense of structural uniqueness. 

It has been previously shown that that leucine is favored (~0.8 kcal mol" 1 ) over Gln 6:>2 
5 at the fusion-active hairpin structure (18). If the stability of the trimeric C52 coiled coil, 
hence the native gpl20/gp41 structure, is assumed to be unaffected by the Gin 652 to Leu 
mutation, then it follows that its energetic preference for the fusogenic conformation provides 
a higher-than-wild-type driving force for membrane fusion. Therefore, increasing the 
stability of the native state by the Asn 656 to Leu mutation prevents the conformational change 
10 and inhibit membrane fusion activity. Thus, the labile nature of the COOH-terminal three- 
stranded coiled coil of gp41 is likely important in facilitating its conformational activation 
required for membrane fusion. 

EXAMPLE 6 

Implications for gp41 function. Results obtained in the preceding examples indicate 
15 that extensive rearrangement and refolding of a-helical coiled coils in the gp41 molecule is 
the basis of the flip-flop molecular model for triggering formation of the fusogenic 
conformation (Figs. 5 A-5B). These changes in conformation indicate that a two-stage 
reaction mechanism is responsible for initiating membrane fusion. In the first, the fusion- 
peptide region is extruded from the native gpl20/gp41 core to the tip of the newly created 
20 NH 2 -terminal trimeric coiled coil, thus allowing its insertion into the bilayer of the target 

membrane (Fig. 5B) (19). In the second, the COOH-terminal coiled-coil trimer dissociates, 
and the resulting three helices relocate with reversal of chain direction and pack against the 
outer surface of the NH,-terminal coiled coil. These two structural changes produce a 
thermodynamically stable, fusogenic hairpin structure (Fig. 5C). Hairpin formation is 
25 thought to bring the viral and cellular membranes together at the fusion site by juxtaposition 
of the membrane anchor and the fusion peptide (7), but the detailed mechanism by which 
membrane apposition leads to the actual coalescence of the two bilayers is unknown. 
Because the trimer-of-hairpins structure is a common feature of diverse viral membrane 



-28- 



fusion proteins (28), this type of activation and refolding could be a general mechanism of 
viral membrane fusion. 

The model of gp41 activation based on results from this study is fully consistent with 
mutagenesis studies. First, site-directed point mutations in the NH 2 -terminal coiled-coil 
5 sequence abolish membrane fusion but have little effect on the oligomerization of the native 
gpl20/gp41 complex (10, 17). Second, the native gp41 structure is thought to be trapped in a 
labile conformation through gpl20-gp41 interactions (4). Consistent with metastable folding 
of gp41 is the observation that the fusion-enhancement Gin 652 to Leu mutation helps tip the 
balance between the labile native and most stable fusogenic states (18). In contrast, the 

10 leucine replacement for the buried Asn 656 residue characterized here may fail to produce a 

sufficient free energy difference between the native and fusogenic folds, thus preventing gp41 
activation. Hence, two types of helix-helix interactions in gp41 enable structural 
rearrangements to take place in a concerted fashion. Third, mutations in gp41 that abrogate 
gpl20 association map to residues in the NH 2 -terminal half of the molecule (17, 21). The 

15 gpl20-gp41 interaction is thought to be responsible for poising the native conformation, with 
gpl20 shedding allowing formation of the trimer-of-hairpins structure upon receptor binding 
(12). These studies demonstrate directly that the NH 2 -terminal heptad-repeat region of gp41 
is indeed an important part of the triggering mechanism for fusion activation. 

EXAMPLE 7 

20 Construction of a chimera of C52 and GCN4-pII. As demonstrated in Example 3, 

a 52-residue peptide (referred to as C52) derived from the COOH-terminal region of the gp41 
ectodomain forms a labile, trimeric coiled coil (Fig. 1) (38). This finding has led to the 
hypothesis that this coiled-coil domain within the native envelope glycoprotein complex 
brings three gp41 molecules into close proximity in order to facilitate their conformational 

25 activation during HIV-1 infection. Thus, the C52 coiled coil appears to be exposed on the 

surface of the infectious virus, raising the possibility that it is a vulnerable target for humoral 
defense and thus a superior immunogen for effective antibody induction. 
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To stabilize this coiled-coil trimer for immunogenicity studies, a peptide was 
constructed in which the 52 residues of the C52 peptide were fused in heptad repeat to the 33 
residues of a trimeric GCN4 coiled coil (GCN4-pII) (Fig. 7A). An expression vector 
(pC52pII) encoding the C52-pII peptide was derived from pC45-pII (38) by the insertion of 
the appropriate DNA sequences encoding the residues LeuTrpAsnTrpPheAsnlle between the 
COOH terminus of C52 and the NH 2 terminus of GCN4-pII. Proteins were expressed in the 
E. coli strain BL21(DE3)/pLysS (Novagen) and purified essentially as described for the N36- 
pl peptide [Shu, W., Ji, H., Lu, M., Biochemistry 38, 5378 (1999)]. The expected molecular 
weights of all concentrations were determined spectrophotometrically in 6M GdmCl 
[Edelhoch, H., Biochemistry 6, 1948(1967]. 

The resulting chimera, C52-pII, was expressed as a recombinant protein in E. coli and 
purified by reverse-phase high-performance liquid chromatography. CD spectra were 
acquired on an Aviv 62DS circular dichroism spectrometer. Measurements of [0] 222 were 
made at 5 uM peptide concentration and 4°C in 50 mM sodium phosphate (pH 7.0) and 150 
mM NaCl. A [G] 222 value of -33,000 degrees cm 2 dmol" 1 was taken to correspond to 1 00% 
helix [Y. H. Chen, J. T. Yang, K. H. Chau, Biochemistry 13, 3350 (1974)]. Thermal melts 
were performed in the same buffer and also with the addition of 4 M GdmCl to facilitate 
unfolding by measuring [0] 222 as a function of temperature [C. R. Cantor and P. R. Schimmel, 
Biophysical Chemistry (Freeman, New York, 1980), vol. 3, p. 1 132]. 

On the basis of circular dichroism measurements, C52-pII is fully helical (Fig. 7C) 
and has a thermal stability that exceeds 100°C. In the presence of the denaturant guanidinium 
hydrochloride (GdmCl) at 4 M concentration, C52-pII melts cooperatively with an apparent 
melting temperature of 89°C (Fig. 7D). 

Sedimentation equilibrium experiments were performed on a Beckman XL-A 
analytical ultracentrifuge equipped with an An-60 Ti rotor. C52-pII solutions were dialyzed 
overnight against 50 mM sodium phosphate (pH 7.0) and 150 mM NaCl, loaded at initial 
concentrations of 5, 15, and 50 uM, and analyzed at 16,000 and 19,000 rpm at 20°C. Data 
were acquired at two wavelengths per rotor speed and processed simultaneously with a 
nonlinear least squares fitting routine [M. L. Johnson, J. J. Correia, D. A. Yphantis, H. R. 
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Halvoson, Biophys. J. 36, 575 (1981)]. Solvent density and protein partial specific volume 
were calculated according to solvent and protein composition, respectively [T. M. Laue, B. D. 
Shah, T. M. Ridgeway, S. L. Pelletier, in Analytical Ultracentrifugation in Biochemistry and 
Polymer Science, S. E. Harding, A. J. Rowe, J. C. Horton, Eds. (Royal Society of Chemistry, 
Cambridge, 1992), pp. 90-125]. The data fit well to a model for an ideal trimer, with no 
systematic deviation of the residuals. The observed molecular weight is 32,100 daltons, 
compared to an expected value of 3 1,400 daltons for a trimer. 

Sedimentation equilibrium measurements indicate that the C52-pII peptide is a 
discrete trimer and exhibits no systematic deviation from trimer molecular weight between 5 
and 50 uM (Fig. 7E). Thus, C52-pII forms a well-structured and extremely stable, three- 
stranded coiled coil. 

Example 8 

Eliciting neutralizing antibodies. To explore the immunogenicity of the COOH- 
terminal coiled-coil region of gp41, the C52-pII, C52, and C45-pII model peptides were used 
to immunize rabbits. To generate antisera, two rabbits each were immunized with the peptide 
C52-pII, C52 or C45-pII. The primary immunization was performed with a dose of 500 ug of 
peptide immunogen emulsified (1:1 mixture) in complete Freund's adjuvant. The animals 
were subsequently challenged with 250, 250, 125, and 125 p.g of the immunogen with 
incomplete Freund's adjuvant for each of four booster immunizations at 3-week intervals. 
20 Sera collected 2 weeks after the third booster inoculation were used for the studies described 
here. Rabbit polyclonal antibodies (total IgG) were purified from serum on a protein A- 
Sepharose column (Amersham Pharmacia Biotech). Total IgG concentrations were 
determined by absorbance at 280 nm (1 optical density = 0.8 mg/ml). The rabbit antibodies 
were generated by Covance Research Products, Inc. (Denver, PA). 

25 The C52 and C45-pII molecules were chosen as controls because the former is 

predominantly unfolded under physiological conditions and the latter lacks seven highly 
conserved hydrophobic amino acids at the COOH-terminus of the gp41 coiled-coil segment, 
relative to C52-pII (38). 
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The purified polyclonal antibodies from the rabbit sera were tested for neutralization 
of a panel of isolates of HIV- 1. Neutralization against primary isolates was assessed using 
PHA-activated PBMC (phytohemagglutinin activated peripheral blood monocytes) as 
indicator cells and determination of p24 antigen production as the endpoint. PBMC were 
5 isolated and stimulated with PHA (5 ng/ml) for 48 hrs followed by PHA and IL-2 (40 U/ml) 
for 72 hrs in RPMI 1640 containing 10% heat-inactivated FBS, 100 U/ml penicillin, 100 
u.g/ml streptomycin and 2 mM L-glutamine. The antibodies were diluted and 50 ul per well 
were pipetted in round bottom microtiter plates, after which an equal volume containing 100 
TCID50 of primary isolate stock was added. The antibody-virus mixture was incubated for 1 

10 hr at 37°C. Next, 100 ul PHA-activated PBMC (5 x 10 5 /ml) were added to each well. The 
calculated neutralization titers refer to the antibody (Ab) concentration present during this 
incubation step. After an overnight incubation, the cells were washed two times with tissue 
culture medium (RPMI). On day 4, 100 ul of the medium was replaced with fresh tissue 
culture medium. On day 7, the cultures were collected and treated with 1% v/v Empigen 

15 (Calbiochem). Triplicate samples were then tested for p24 content using ELISA (Moore, J.P., 
et al., Science 250, 1 139-1 142). In brief, sheep anti-gpl20 Ab D7320 (Aalto Bioreagents) 
was coated overnight on 96- well polystyrene EIA plates (Costar) in 100 mM NaHCOS pH 
8.5. The plates were washed in PBS and p24 was captured from serial dilutions of the HIV-1 
containing samples in PBS/0.1% Empigen. After a 3 hr incubation, unbound p24 was washed 

20 away and bound p24 was detected with alkaline phosphatase-labeled Ab BC1071 

(International Enzymes) diluted 1:3,000 in PBS containing 20% sheep serum and 2% NFDM. 
After a 1 h incubation, the plates were washed and developed with an AMPAK kit (Dako 
Diagnostics) as recommended by the manufacturer. Production of p24 antigen in the Ab- 
containing cultures was compared to p24 production in cultures without Ab run in the same 

25 assay and the Ab concentrations resulting in a 50% and 90% reduction in p24 content were 
determined. 

Results reported herein indicate that stable, trimeric versions of the gp41 COOH- 
terminal heptad-repeat region as a component of combination vaccines against HIV-1 
infection. (Table 1) C52-pII(B) is able to elicit antibodies capable of efficient neutralization 
30 of primary HIV-1 isolates. See Table 1. 
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Table 1. Neutralization of HIV-1 primary isolates by the antiserum C85-pII(B) 



EC50IgG 
(ug/ml) 


HIV-1 


NL4-3 


JRFL 


92US657 


2076cl.3 


HC4 


DH123 


SF162 


C52-pII(B) 


214.03 


135.08 


105.39 


>500 


>500 


215.43 


135.42 


2F5 


15.18 


4.37 


57.83 


38.13 


34.18 






CD4-IgG2 


0.21 


0.51 


>10 


>10 


>10 


5.18 


<0.062 



Given the similarity in structure between C52-pII and C45-pII, and the identity in the 
5 gp41 portion between C52-pII and C52, it appears that a trimeric coiled-coil segment formed 
by the sequence LWNWFNI (amino acids 669 to 675 of gp41) is involved in raising 
neutralizing antibodies. Trp 670 and Trp 672 are completely conserved in 213 fully sequenced M 
group HIV-1 strains, while there is only a single conservative methionine substitution for 
He 675 . In addition, He 669 and Phe 673 occur in 205 and 204 of the 213 sequences respectively, 
10 with only a single nonconservative change present at each position. Even two more variable 
positions (671 and 674) are occupied by single polar and negatively charged residues in more 
than 75 and 60% of the sequences respectively; all the remaining sequences at each position 
possess conservative changes. It appears therefore, that there is selective pressure on the a 
and d positions to maintain trimeric coiled-coil interactions, as well as pressure on the outside 
1 5 heptad positions to preserve particular types of amino acid character. These conserved 
sequence and structural elements likely underlie the ability of C52-pII to elicit broadly 
reactive neutralizing antibodies, thus making it an attractive candidate for vaccine efforts. 

Immediately NH 2 -terminal to the sequence LWNWFNI is an epitope (the linear 
sequence ELDKWA) recognized by the human antibody 2F5 isolated from an HIV-1 -infected 

20 donor (SI). Because the immunizing C52-pII peptide was able to abrogate the fusion 

inhibition activity of the neutralizing antibodies, whereas C52 and C45-pII (both containing 
ELDKWA) were not (Figure 8), it is likely that the discontinuous neutralization determinant 
defined in this study is different from the continuous 2F5 epitope. Consistent with this notion 
is the observation that C85-pIIB serum and 2F5 represent different specificities in 

25 neutralizing seven primary clinical isolates tested (Table 1). Thus, the conserved, 

conformation-specific epitope in the C52-pII 'tailor-made' molecule appears to produce more 
effective antibody responses than a natural HIV-1 infection. 
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